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ABSTRACT 

We present a model of truthful elicitation which generalizes 
and extends both mechanisms and scoring rules. Our main 
result is a characterization theorem using the tools of convex 
analysis, of which characterizations of mechanisms and scor- 
ing rules represent special cases. Moreover, we demonstrate 
that a variety of existing results in the mechanism design lit- 
erature are often simpler and more direct when first phrased 
in terms of convexity. We then generalize our main theorem 
to settings where agents report some alternate representa- 
tion of their private information rather than reporting it 
directly, which gives new results about both scoring rules 
and mechanism design. 



lHj: i. INTRODUCTION 



In this paper, we examine a general model of information 
elicitation where a single agent is endowed with some type 
t that is private information and is asked to reveal it. Af- 
ter doing so, he receives a score that depends on both his 
report t' and his true type t. For reasons that will become 
clear, we represent this as a function S(t')(t) that maps his 
reported type to a function that maps types to real num- 
bers, with his score being this function applied to his true 
type (equivalently his reported type selects from a param- 
eterized family of functions with the result applied to his 
true type). We allow S to be quite general, with the main 
requirement being that S(t')(-) is an affinqj function of the 
true type t, and seek to understand when it is optimal for 
the agent to truthfully report his type. Given this restric- 
tion, it is immediately clear why convexity plays a central 
role - when an agent's type is t, the score for telling the 
truth is S(t)(t) — sup t , S(t')(t), which is convex function of 
t as the pointwise supremum of affine functions. 

One special case of our model is mechanism design, where 
S(t') can be thought of as the allocation and payment given 
a report of t' , which combine to determine the utility of the 
agent as a function of his typs0. In this context, S(t)(t) 
is the consumer surplus function, and Myerson's |18| well- 
known characterization states that, in single-parameter set- 
tings, a mechanism is truthful if and only if the consumer 
surplus function is convex and its derivative (or subgradi- 
ent at points where it is not differentiable) is the allocation 

1 A mapping between two vector spaces is affine if it consists 
of a linear transformation followed by a translation. 
2 It suffices to consider a single agent because notions of 
truthfulness such as dominant strategies and Bayes-Nash 
are phrased in terms of holding the behavior of other agents 
constant. See [101 [T] for additional discussion. 



rule. More generally, this remains true in higher dimensions 
(see [T]). Note that here the restriction that S(t') is affine 
is without loss of generality, since we may consider types as 
functions mapping an outcome to the agent's utility for that 
outcome, and the evaluation of a type on an outcome (or a 
distribution over outcomes) is affine in that type. 

Another special case is a scoring rule, where an agent is 
asked to predict the distribution of a random variable and 
given a score based on the observed realization of that vari- 
able. In this setting, types are distributions over outcomes, 
and S(t')(t) is the agent's expected score for a report that 
the distribution is t' when he believes the distribution is t. 
As an expectation, this score is linear in the agent's typeQ 
Gneiting and Raftery [13! unified and generalized existing re- 
sults in the scoring rules literature by characterizing proper 
scoring rules in terms of convex functions (the negative of 
the Bayes risk from decision theory) and their subgradients. 

The similarities between these two models were noted by 
Fiat et al. [11], who gave a construction to convert mecha- 
nisms into scoring rules and vice versa. In this paper, we 
prove a general characterization, of which these characteriza- 
tions of scoring rules and mechanisms are special cases. Our 
proof is essentially a combination of Gneiting and Raftery's 
scoring rule construction [13] with a technique from Archer 
and Kleinberg [l] for handling mechanisms with non-convex 
type spaces. Our characterization not only shows how mech- 
anisms and scoring rules relate to each other, but also pro- 
vides an understanding of how results about mechanisms 
relate to results about scoring rules and vice versa. In par- 
ticular, many results in each literature are essentially results 
in convex analysis, and by phrasing them as such it is im- 
mediately clear how they apply in the other domain. 
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Figure 1: Proof structure of existing mechanism de- 
sign literature 



3 Since distributions lie on an affine space, any affine function 
can be implemented as well. 
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Figure 2: Our proof structure 

After proving our main result, we examine a number of 
characterizations from the mechanism design literature that 
generally focus on when there exist payments that make a 
given allocation rule truthful. Figure[T]illustrates these char- 
acterizations and how they were proved. As it shows, several 
of them rely on showing equivalence to a condition known as 
cyclic monotonicity. Instead, we translate these results into 
convex analysis terms and prove them by showing equiva- 
lence to the property of being a family of subgradients of 
a convex function (see Figure [2}. This has two main bene- 
fits. First, since cyclic monotonicity is a difficult condition to 
work with, we are able to greatly simplify the proofs of these 
results. Second, our proofs generally proceed by explicitly 
constructing the convex function, which gives a natural char- 
acterization of the payments rather than just showing that 
they exist. This approach also illuminates how a result by 
Carroll T about truthful mechanisms is essentially a similar 
characterization (see Theorem [4]). 

Next, we turn analyzing situations where, rather than ask- 
ing for an agent's type, we wish to elicit a simpler repre- 
sentation of it. In the scoring rules context, this has been 
studied as the elicitation of properties of a distribution such 
as the mean or median [121 [16] . In mechanism design, this 
is implicit in settings such as matching where a ranking over 
potential matches is elicited rather than the agent's utility 
for them. We generalize our main result to this setting, 
which allows us to prove several new results. Perhaps the 
most notable is that if the set of distributions has positive 
measure in its convex hull then then there is a unique value 
of the property almost everywhere. 

In addition to our more significant results, we get a num- 
ber of smaller new results along the way essentially for free. 
These include: 

• the fact that in mechanisms which select among a fi- 
nite set of (allocation, payment) pairs, the set of sets of 
types that select each outcome forms not just a set of 
polyhedra but a power diagram; 

• a relaxation of an outcome compactness assumption 
needed by Archer and Kleinberg [T| for allocation rules 
on non-convex type spaces; 

• a characterization of proper scoring rules for non-convex 
sets of distributions; and 

• a characterization that a scoring rule is proper if and 
only if it is locally proper 



Notation.. 

We define M = M U { — oo, oo} to be the extended real 
numbers. Given a set of measures M on X, a function / : 
X — > R is Af-quasi-integrable if J x f(x)d(j,(x) G R for all 
/i g M. Let A(X) be the set of all probability measures 
on X. . We denote by Aff(X -> Y) and Lin(X -> Y) the 
set of functions from X to Y which are affine and linear, 
respectively. We write Conv(A) to denote the convex hull of 
vector space X, the set of all (finite) convex combinations 
of elements X. 

2. MODEL AND MAIN RESULT 

We consider a very general model with an agent who has 
a given type t G T and reports some possibly distinct type 
t' G T, at which point the agent is rewarded according to 
some score S which is affine in the true type t. This reward 
we call an affine score. We wish to characterize all truthful 
affine scores, those which incentivize the agent to report her 
true type t. 

Definition 1. Any function S : T — ¥ A, where T C V 
for some vector space V and A C Aff(T — > R), is a affine 
score. We say S is truthful if for all t,t' G T, 

(1) 

'■ t, t' , then S is strictly truth- 



S(t')(t) < S(t)(t). 



If this inequality is strict for i 
ful. 



Before stating our characterization result, we note two 
important applications of, and indeed motivations for, this 
framework: mechanism design, and proper scoring rules. 

Definition 2. Given outcome space O and a type space 
TC [O — ¥ M), consisting of functions mapping outcomes to 
reals, a mechanism is a pair (f,p) where f : T —¥ O is an 
allocation rule and p : T — > M is a payment. The utility 
of the agent with type t and report t' to the mechanism is 
U(t',t) = t(f(t')) — pit'); we say the mechanism (f,p) is 
truthful ifU(t',t) < U(t,t) for allt,t' G T. 

Definition 3. Given outcome space O and set of prob- 
ability measures V C A(O), a scoring rule is a function 
S:PxO->I. We say S is proper if for all p,q eV, 

E [S(q,o)] < E [S(p,o)]. (2) 

If the inequality in @ is strict, then S is strictly proper. 

These two models fit comfortable within our framework. 
For the mechanism, viewing T as a vector space of functions, 
the term t(f(t')) is linear in t. Hence, {/(-, t') is affine. Thus, 
the mechanism is simply the affine score with A — {t H» 
t(o) + c| o G 0,c G R} defined by S(t')(t) := U(t,t'). Simi- 
larly, a scoring rule S is an affine score S : T — > A with T = 
V, and A C {p i— > j f(o)dp(o) | / is P-quasi-integrable}. 

Our characterization relies heavily on convex analysis, a 
central concept of which is the subgradient of a function. 

Definition 4. Given some function G : T — > R, a func- 
tion dGt G Lin(T — y R) is a subgradient to G at t if for all 
f G T, 



G(t') > G(t) + dG t (t' -t). 



(3) 



We denote by dGt the set of subgradients to G at t, and 
dG = U teT dG t . 



For mechanism design, it is typical to assume that utilities 
are always real valued. However, the log scoring rule (one 
of the most popular scoring rules) has the property that if 
an agent reports that an outcome will occur with probabil- 
ity and then that outcome does occur, the agent receives 
a score of -co. Essentially solely to accommodate this, we 
allow affine scores and subgradients to take on values from 
the extended reals. It is standard (see 13.) to restrict con- 
sideration to the regular case, where essentially only things 
like the log score are permitted to be infinite. In particu- 
lar, an afline score S is regular if for all t G T S(t)(t) G R 
and for t' =£t S(t')(t) 6lU {-co}. Similarly, a parameter- 
ized family of linear functions (e.g. a family of subgradients) 
{dG t G Lin(T ->■ R)}ter is regular if for all t G T dG t {t) G R 
and for t' ^ t dG t / (t) G RU{ — oo}. For the remaineder of the 
paper we assume all affine scores and parameterized families 
of linear functions are regular. Note that certain results in 
Section [3] require a stronger assumption that the relevant 
parameterized families are in fact real valued rather than 
simply regular. A reader not interested in the details of how 
our framework incorporates the log scoring rule can assume 
that all affine scores and families are real valued throughout 
the paper with little loss. 

We now state, and prove, our characterization theorem. 
Note that the proof draws techniques and insights from 
Gneiting and Raftery [13] and Archer and Kleinberg [T], but 
in such a way as to simplify the argument considerably. 

Theorem 1. Let an affine score S : T — > A be given. 
S is truthful if and only if there exists some convex G : 
Conv(T) —5- R with G(T) C R, and some selection of sub- 
gradients {dGt}teT, such that 

S(t')(t) = G(t') + dG t ,(t-t'). (4) 

Proof. It is trivial from the subgradient inequality <(3j 
that the proposed form is in fact truthful. For the converse, 
we are given some truthful S : T — > A. Note first that for 
any t G Conv(T) we may write t as a finite convex combi- 
nation t — Y^rLi a iti where U G T. Now, as the range of S 
is affine, we may naturally extend S(t) to all of Conv(T) by 
defining 

m 

S(t)(t) =^a i S(t)(t i ). (5) 

i=l 

One easily checks that this definition coincides with the 
given S on T. 

Now we let G(t) := sup (g7 - S(t)(t), which is convex as the 
pointwise supremum of convex (in our case affine) functions. 
Since S is truthful, we in particular have G(t) — S(t)(t) G R 
for all t G T by our regularity assumption. Also by truth- 
fulness, we have for all t, t' G T, 

G(t) = S(t)(t) > S(t')(t) = S(t')(t') + Si(t')(t - t') 
= G(t') + S t {t')(t-t'), 

where Se{t') is the linear part of S(t'). Hence, Se(t') satis- 
fies © for G at t! and so S is of the form (0} . □ 

3. CONVEXITY IN MECHANISM DESIGN 

Interpreted in the general mechanism design framework 
given by Definition [2] Theorem [1] says that a mechanism 
(/, p) is truthful if and only if the consumer surplus func- 
tion 1 1 — y U(t,t) that it implicitly defines on the convex hull 



of T is a convex function which has a subgradient consistent 
with / on T. This is a known characterization for the case of 
convex T (as well as non-convex T that satisfy an assump- 
tion known as outcome compactness) [1] , but in practice the 
consumer surplus function is not always the most natural 
representation of a mechanism. In this section, we exam- 
ine two other approaches to characterizing truthful mecha- 
nisms that have been explored in the literature and show 
that they have insightful interpretations in convex analysis. 
This interpretation has two benefits. First, by focusing on 
the essential convex analysis questions we are able to greatly 
simplify many of the proofs. Second, our proofs are con- 
structive; in many cases we explicitly construct a consumer 
surplus function G, which when the mechanism is being rep- 
resented by its allocation rule gives the necessary payments 
rather than simply providing a proof that payments exist. 

3.1 Subgradient characterizations 

From an algorithmic perspective, it may be more natural 
to focus on the design of the allocation rule / rather than the 
specific payments. There is a large literature that focuses 
on when there exists a choice of payments p to make / into 
a truthful mechanism (e.g. [241 [2]). Since such payments 
exist if and only if there is a convex function for which / is a 
subgradient at points in T, this is essentially a very natural 
convex analysis question: when is a function / a subgradient 
of a convex function? Unsurprisingly, the central result in 
this literature is closely connected to convex analysis. 

Definition 5. A family {dG t G Lin(T — > R)}ter satis- 
fies cyclic monotonicity (CMON) if for all finite sets {to, . . . ,tk} C 
T, 

k 

Y, dG ti(ti+i-ti)<0, (6) 

i=0 

where indices are taken modulo k + 1. We refer to the weaker 
condition that ([6]) hold for all pairs {to,ti} as weak mono- 
tonicity (WMON). 

A well known characterization from convex analysis is that 
a function / defined on a convex set is a subgradient of 
a convex function on that set iff it satisfies CMON [22] . 
Rochet's [21] proof that a such payments exists on a possibly 
non-convex T iff / satisfies CMON is effectively a proof of 
the following generalization of this theorem. 

Theorem 2. A family {dGt G Lin(T —5- R)}ter satisfies 
CMON if and only if there exists a convex G : Conv(T) — ► R 
such that dGt is a subgradient of G at t for all t G T. 

Rochet notes that his proof is adapted from the one given 
in Rockafellar's text [22] of the weaker theorem where T is 
restricted to be convex. We adapt Rochet's proof to high- 
light how its core is a construction of G. 

Proof. Given such a G, by ^ we have dG ti {U+i —U)< 
G(ti+i) — G(ti). Summing gives ([6]). Given such a family 
{dGt}teT, fix some to G T and define 

k 

G{t)= sup 5ZdGt 4 (*i+x-t<), (7) 

{tl,---,tfc+l}CT, j_Q 

where {ti, . . . , t^} denotes any finite sequence (k is not fixed). 



By CMON, for t £ T this sum is upper bounded by 
—dGt(to — t). Thus, the supremum is finite on T. G is 
a pointwise supremum of convex functions, so is convex. By 
convexity, G is also finite on Conv(T). 

For any t £ T and t' £ Conv(T), 

k 

G(t) + dG t (t' -t)=dG t {t' -t)+ sup VdG ti (i i+1 - 

{t 1 ,...,t k + 1 }CT, j=0 
& 

= sup rfGt t (ti+i — u) 

.,* fc+ i}cr , l=0 

fe 

< sup dG ti (tj+i — tj) 

{*!,.. ,i l+1 }cr, l=0 

= G(0, 
so ciGt satisfies (|3])- d 

A number of papers have sought simpler and more natu- 
ral conditions than CMON that are necessary and sufficient 
in special cases, e.g. [241 HI [2] . These results are typically 
proven by showing they are equivalent to CMON. However, 
it is much more natural to directly construct the relevant G. 
This also often has the advantage of providing a characteri- 
zation of the payments that is more intuitive than the supre- 
mum in Rochet's construction. As an example, we show one 
such result has a simple proof using our framework. 

As in Myerson's [18] construction for the single-parameter 
case, we construct a G by integrating over dGt- In partic- 
ular, for any two types x and y our construction makes use 
of the line integral 

/ dG t {y-x)dt= dG (1 _ t)x+ty (y - x)dt. 

J Lxy JO 

As Berger et al. J), and Ashlagi et al. [2] observed, if {dGt}teT 
satisfies WMON and T is convex, this (Riemann) integral is 
well defined because it is the integral of a monotone function. 
If these line integrals vanish around all triangles (equiva- 
lentl y dG t(y- x ) dt + I Ly ; dG t (z-y)dt = J LmM dG t (z- 
x)dt)) we say {dGt} satisfies path independence. 

Theorem 3 (adapted from [T7] ) . For convex T, a fam- 
ily {dGt £ Lin(T — > R)}tgr is a subgradient of a convex 
function if and only if {dGt}teT satisfies WMON and path 
independence. 

Proof. Given a convex function G and {dGt}, {dGt} 
satisfies CMON and thus WMON. Path independence also 
follows from convexity (Rockafellar 22 p. 232). Now given 
a {dGt} that satisfies WMON and path independence, fix 
a type to € T and define G(t') = L dG t (t' - t )dt (well 

defined by WMON as the integral of a monotone function). 
Given x,y,z £ T such that z — \x + (1 — \)y, by path 
independence and the linearity of dG z we have 

\G{x) + (1 - A)G(y) 

= G(z) + A f dG t (x- z)dt + (1 - A) f dG t {y - z)dt 
> G(z) + \dG z {x - z) + (1 - X)dG z (y - z) = G(z), 



so G is convex. Similarly, for x, y 6 T, dGt satisfies ([3]) 
because 

dG x (y-x)< f dG t (y-x)dt = G(y)-G(x). □ 

J L x y 

3.2 Local convexity 

ti) In many settings, it is natural to specify mechanisms in 
terms of an algorithm that computes the allocation and pay- 
ment. As it is often easier to reason about the behavior of 
algorithms given small changes to their input rather than ar- 
bitrary changes, several authors have sought to characterize 
truthful mechanisms using local conditions [TJG3G]- 

We show in this section how many of these results are 
in essence a consequence of a more fundamental statement, 
that convexity is an inherently local property. For example, 
in the twice differentiable case it can be verified by deter- 
mining whether the Hessian is positive semidefinite at each 
point. We start with a local convexity result, and use it to 
show that an affine score is truthful if and only if it satisfies 
a very weak local truthfulness property introduced by Car- 
roll [7]. Afterwards we turn to a similar characterization by 
Archer and Kleinberg [I]. 

Definition 6. Given a function G , a family {dGt £ Lin(T— > 
R)}t S r * s a weak local subgradient with respect to G (G- 
WLSG) if for allt € T there exists an open neighborhood Ut 
of t such that for all t' £ Ut, 

G{t) > G{t')+dG t >{t-t') and G(t') > G(t)+dG t (t'-i). 

(8) 

We now show that G-WLSG is a sufficient condition for 
a family of functions to be a subgradient of G. The proof is 
heavily inspired by Carroll [7J. 

Theorem 4. Let T be convex. A family {dGt £ Lin(T — > 
RjtgT is a subgradient of a given function G if and only if 
it satisfies G-WLSG. 

Proof (Adapted from [7]). As usual, the forward di- 
rection is trivial. For the other, let t, t' £ T be given; 
we show that the subgradient inequality for dG t > holds at 
t. By compactness of Conv({£, t'}), we have a finite set 
ti = ait' + (1 - oti)t, where = cto < ■■■ < a k +i = 1, 
such that G-WLSG holds between each ti and ii+i. (The 
cover {Us | s £ Conv({t, t'}) has a finite subcover. Take t2i 
from the subcover and fei+i £ Ut 2i n Ut 2i+2 •) By the WLSG 
condition we have for each i, 

> G(U +1 ) - G(U) + dG H+1 {ti - t»+i) (9) 
> G(U) - G(t i+1 ) + dG u (U+i - U). (10) 

Now using the identity ti+i — ti = (Qi+i — cti)(t' — t) and 
adding ai/(a i+ i-ai) times © to a i+ i / '(a i+1 -cti) times JT0), 
we have 

> G(ti)-G{ti + i)+aidG ti (t'-t)-a i+ idG ti+1 (t'-t). (11) 
Summing (111[) over < i < k gives 

> G(t ) - G(t fe+ i) + Q rfGt (t' -t)- a k+1 dG tk+1 (*' - t), 

which when recalling our definitions for cti and ti yields the 
result. □ 

The WLSG condition translates to an analogous notion in 
terms of truthfulness, weak local truthfulness. 



Definition 7. An affine score is weakly locally truthful 
if for all t £ T there exists some open neighborhood Ut of 
t, such that truthfulness holds between t and every t' G U t , 
and vice versa. That is, 

Vt G T, Vi' G U t , S(t')(t) < S(t)(t) and S(t)(t') < S(t')(t'). 

(12) 

Corollary 5 (Generalization of Carroll |7j). An 
affine score S : T — » A for convex T is truthful if and only 
if it is weakly locally truthful. 

PROOF. Defining G(t) := S(t)(t), by weak local truthful- 
ness we may write 

Git) = S(t)(t) > S(t')(t) = G(t') + S e (t')(t - t') 
G(t') = S(t')(t') > S(t)(t') = G(t) + S e (t)(t' - t), 

where Si(-) is the linear part of S(-). This says that dGt = 
Se(t) satisfies G-WLSG; the rest follows from Theorem [4] 
and Theorem [T] □ 

Finally, in the spirit of Section 13.11 Archer and Klein- 
berg p] characterized local conditions under which an allo- 
cation rule can be made truthful. A key condition from their 
paper is vortex-freeness, which is a condition they show to be 
equivalent to local path independence. The other condition, 
local WMON, means that WMON holds in some neighbor- 
hood around each type. Their result follows directly from 
an analogous characterization of subgradients. 

Corollary 6. LetT be convex. A family {dG t G Lin(T — > 
R)}*eT is o, subgradient of a convex function if and only if 
it satisfies local WMON and is vortex-free. 

Proof. We prove the reverse direction; suppose {dGt}tcT 
satisfies local WMON and is vortex-free. From Lemma 3.5 
of [I] we have that vortex-freeness is equivalent to path inde- 
pendence, so by Theorem [3] for all t there exists some open 
Ut such that {dG t '}t'eu t i s the subgradient of some convex 
function : Ut — > R- We need only show the existence of 
some G such that {dGt}teT is the subgradient of G on each 
Ut; the rest follows from Theorem [4] 

Fix some to G T and define G(t) = f L t dG t >dt' , which 

is well defined by compactness of Conv({io,£}) and the fact 
that a locally increasing real-valued function is increasing. 
But for each t' and t G U t > we can also write G'* '(t) = 
J L dG t ndt" by |22l p. 232], and now by path indepen- 
dence we see that G and G'* ' differ by a constant. Hence 
{dGt}teT must be a subgradient of G on U t > as well, for all 
t' G T. □ 

4. EXTENSION TO PROPERTIES 

In many settings, it is difficult, or even impossible, to 
have agents report an entire type t € T. For example, when 
= R, then a mechanism with T = (0 — > R) requires 
agents to submit an infinite-dimensional type, and even type 
spaces which are exponential in size can be problematic. It 
is therefore natural to consider a model of truthful report- 
ing where agents provide some sort of summary information 
about their type. Such a model has been studied in the scor- 
ing rules literature, where one wishes to elicit some statistic, 
or property, of a distribution |12l 116] . We follow this line of 
research, and extend our affine score framework to accept 



reports from a different (intuitively, smaller) space than T. 
Our results shed light on the structure of properties and 
their affine scores. 

4.1 General setting 

We wish to generalize the notion of an affine score to ac- 
cept reports from a space R which is different from T. To 
even discuss truthfulness in this setting, we need a notion of 
a truthful report r for a given type t. We encapsulate this 
notion by a general multivalued function which specifies only 
and all of the correct values for t. 

Definition 8. Let R be some given report space. A prop- 
erty is a multivalued map T : T =4 R which associates a 
nonempty set of correct report values to each type. We let 
r r := {t € T\r G r(t)} denote the set of types t correspond- 
ing to report value r. 

One can think of r r as the "level set" of V corresponding to 
value r. This concept will be useful mainly when we consider 
finite- valued properties in Section T4. 21 

We extend the notion of an affine score to this setting, 
where the report space is R instead of T itself. 

Definition 9. An affine score S : R — ► A elicits a prop- 
erty r : T =t R if for all t, 

r(t) = argsup S(r)(t). (13) 

A property T : T =X R is elicitable if there exists some affine 
score S : R — > A which elicits T. 

As before, we need a notion of regularity here as well - 
an affine score S is F-reguJar if S(r)(t) < oo always and 
S(r)(t) G R whenever r 6 T(t). 

The simplest way to come up with an elicitable property 
is to induce one from an affine score. For any S : R — >• 
Aff(T -> R), the property 

F s : t -> argsup 5(r)(t) (14) 

rER 

is trivially elicited by S. 

Observe also that any affine score S eliciting F gives rise 
to a truthful affine score in the original sense - in fact, this 
is a version of the so-called revelation principle. For each t 
let r t G r(t) be a report choice for t; then the affine score 
S T (t')(t) := S(r t r)(t) is truthful. Moreover, by our choices 
of {rt}, we have 

G(t) := sup S T (t')(t) = sup S(r)(t). (15) 
t'er re-R 

Of course, in general, S 7 ' will not be strictly truthful, since 
by definition, any reports t' ,t" with ry = r t " (which can 
happen when t',t" G F r for some r) will have S T (t') = 
S T (t"). Thus we may think of a property as refining the 
notion of strictness for a truthful affine score. The connec- 
tion we will draw in Theorem [S] is that, in light of (|15p . a 
property F therefore specifies the portions of the domain of 
T where G must be "flat". 

To get at the connection between properties and "flatness", 
we start with a technical lemma which shows that having the 
same subgradient at two different points implies that G is 
flat in between. 

Lemma 7. For convex T , let G : T — > R be convex, and 
let d G dGt for some t G T . Then for all t' G T, 

dedG t , G(t)-G(t') = d(t-t'). 



Proof. First, the forward direction. Applying the sub- 
gradient inequality ([3]) at t' for dGt = d and at t for dG t i = 
d, we have 

G(t') > G(t) + d(t' - t) 

G{t) >G(t')+d(t-t'), 

from which the result follows. For the converse, assume 
G{t) = G(t') + d(t - t') and let t" € T be arbitrary. Then 
since d G dGt we have 

G(t') + d(t" - t') = G(i') + d(t" -t) + d(t - t') 

= G(i) + d(t" - i) 

< G(i"), □ 

We are now ready to state our characterization in this 
setting, which in essence says that eliciting a property F is 
equivalent to eliciting subgradients of a convex function G. 

Theorem 8. Let property Y : T =S R and Y -regular affine 
score S : R — » A be given. Then S elicits Y if and only if 
there exists some convex G : Conv(T) — ¥ R with G(T) C R 
and ip : R — > dG satisfying r G Y(t) •<=>■ <p(r) £ dGt, such 
that 

S{r)(t) = G(t r ) + <p(r){t-t r ), (16) 
where {t r } r eR satisfies r G Y(t r ) for all r. 

Proof. For the forward direction, assume that affine score 
S elicits r. For each r, we may extend S(r) to all t G 
Conv(T) by linearity as in Theorem [T] whence we may de- 
fine G(t) := sup re ^ S(r)(t), which is finite for i G T as S is 
F-regular. We wish to show that the choice ip : r i— > Se(r) 
suffices, where Si denotes the linear part of 5*. 

By standard arguments (cf. Proposition 8.12 of 23.), the 
subgradients to a convex function F:l->Rati€l are 
precisely the gradients of its affine supports at x. More for- 
mally, let AM(J ? ) = {a G Aff(X -»■ R) | Vz F(a;) > o(x)} be 
the set of affine functions minorizing F, and let AS(F, x) = 
{a G AM(F) | aix) = -^(a;)} be the set of affine supports of 
F at x. Then we have 8F X = {da\a£ AS(F, x)}. 

By definition of G, we have S(r) G AM(G). Noting that 
dS(r)t ~ Se(r), we have for all t G T and r £ R, 

r G r(t) ^ = sup 5(r')(*) 

<=> S(r)(f) = G(t) 

G AS(G,t) 
5/(r) G 9G t , 

where we use elicitation in the first biconditional. Finally, 
using T- regularity and the definition of t r , we show the 
form 

G(tr) + S e {r)(t - U) = S(r)(t r ) + S e (r){t - tr) = S(r)(t). 

For the converse, let G, 99, and t r be given, and assume 
S has the form (|16|l . First note that by definition of {t r }, 
and by assumption on ip, we have v 3 ! 7 ") G <9Gt r for all r. We 
claim that G dominates 5, meaning for all r G R and all 
£ G T, G(t) > S(r)(t); this follows from the definition of S 
and the subgradient inequality (0 applied to y>(r) at t. 

Now by Lemma (7) we have 

<p(r) G 9Gt ^> G(t) - G(t r ) = ^(r-)(t - U) 

G(t) = 5(r)(t). (17) 



By definition of a property, we have for all t G T that T(f) ^ 
0, so by the above G(t) = S(r)(t) for some r. Combining 
this with the fact that G dominates 5, we have G(t) — 
sup r S(r)(t) for all t G T. Putting this together with ((17)1 
and our assumption that r G F(t) <^( r ) £ 9Gt, we 

have 

rer(t) ^ G(t) = 5(r)(t) ^ r G argsup S(r')(t) □ 

Returning to the above discussion, by focusing on S T in- 
stead of S, we now see how properties are essentially a re- 
finement of strictness. In fact, one can can formalize this 
by considering that, up to a remapping of R, S T is strictly 
truthful if and only if S elicits Y : t {i}, and S T is truth- 
ful if and only if S elicits some F such that t G Y(t) for all T. 
Hence, Theorem [5] is actually a generalization of our main 
result, Theorem [1] 

Returning our focus to R, our characterization sheds light 
on the structure of elicitable properties. For example, we 
may apply the fact that convex functions are differentiable 
almost everywhere to yield a very strong result about the 
structure of elicitable properties. 

Corollary 9. If T : T =4 R is an elicitable property, T 
is of positive measure in Conv(T), and no property value is 
redundant (Wi,r2 G R,Y ri % r r2 ) then |r(i)| = 1 almost 
everywhere. 

4.2 Finite case 

We now turn to the case where 7? is finite. In this scoring 
rules literature, this case has been thought of as eliciting 
answers to multiple-choice questions [TS]. There are also 
natural applications to mechanism design, which we mention 
below. 

Assume throughout that R is finite and that T is a subset 
of a vector space V endowed with an inner product, so that 
we may write (t, t') and in particular \\t\\ 2 — (t,t). In this 
more geometrical setting, we will use the concept of a power 
diagram from computational geometry. 

Definition 10. Given points pi G V, called sites, and 
weights w(pi) G R for i G {1, . . . , m}, a power diagram is a 
collection cells cell(pi) C T defined by 

cell(p;) = {t G T\ i = argmin{|]pi - t\\' 2 — w(j)j)}. (18) 

j 

The following result is a generalization of Theorem 4.1 
of [15], and is essentially a direct application of Proposition 
1 from [3]. 

Theorem 10. A property Y : T =4 R for R finite is elic- 
itable if and only if the level sets {Y r } r £H form a power 
diagram. Moreover, every S : R A. can be written 

S(r)(t) = \\t\\ 2 -\\pr-tf+w(p r ), (19) 

for some choice of sites {p r G V}r^R and weights w(p r ) G R. 

Proof. Given sites {p r } and weights w(p r ), let S take 
the form (|19p . Then taking Y(t) := argsup r S(r)(t), we see 
r r = cell(p r ). 

Given affine score S eliciting Y, note that since we are in 
an inner product space, we may write S(r)(t) = (x r , t) + c r 
for x r G V and c r G R. Now take p r = x r /2 and w(p r ) — 
\\p r \\ 2 + c r ; then 

\\tf-\\pT-t\\ 2 +w(p T ) = 2 { P r,t)-\\pr\\ 2 +c r +\\ Pr \\ 2 = S(r) (t), 



which simultaneously proves the form ()19jl and that r r = 
cell(p r ). □ 

We can now combine Theorem \W\ and Corollary [5] to ob- 
tain the following simple truthfulness check for the finite 
case. 

Theorem 11. For finite R, an affine score S : R ^ A 
elicits r if and only if for all r, r' such that F r n T r i 7^ 0, we 
have S(r')(t) < S(r)(t) for all t 6 T r \ F r >. 

We now apply our results to mechanism design with a 
finite set of allocations. One example of such a setting is 
ordinal utilities over a finite set of outcomes. Let T = O — > 
R and let R be the set of preference orderings over O, which 
we represent as permutations tv of O (higher preference being 
first). Then a natural property is 

T(i) = {tt 6 R I Vi < 3, t(m) < tfc)}, (20) 

the set of rankings consistent with type t. More generally, 
truncated rankings or any mechanism with a finite message 
space yields such a property. 

Carroll [7] points out that it is known that for all finite sets 
of allocations, the set of types for which a particular alloca- 
tion is optimal forms a polyhedron and then shows that for 
all convex set of types it is sufficient to check incentive com- 
patibility constraints between polyhedra that intersect at a 
face. Our results in this section show something stronger: 
the types form not just an arbitrary set of polyhedra, but 
a power diagram. However, the sufficiency condition we get 
is slightly weaker, as it requires checking incentive compat- 
ibility between types that intersect at a vertex as well as at 
a face. 

5. ADDITIONAL APPLICATIONS 

In this section, we demonstrate the power of our frame- 
work with several additional applications drawn from the 
scoring rules literature. 

5.1 Two Results About Scoring Rules 

The role of convexity in the scoring rules literature dates 
back to 1971 when Savage showed how to construct a scor- 
ing rule from a convex function |25| . The general, mod- 
ern characterization of scoring rules is due to Gneiting and 
Raftery [13], who used a more nuanced convex analysis ap- 
proach to clarify and generalize a number of previous charac- 
terizations, including Savage [25] and Schervish [26]. In this 
section, we show that the Gneiting and Raftery characteri- 
zation is a simple special case of Theorem [1] and moreover 
that our results enable several generalizations in the scoring 
rule setting. 

Letting T be the set of 'P-quasi-integrable functions / : 
O — > R, we apply our characterization with T = V and 

A = { P ^ J f(o)d P (o)\f eT}. 

COROLLARY 12. For an arbitrary set V C A(O) of prob- 
ability measures, a regula^\ scoring rule S : V x O — > R 
is proper if and only if there exists a convex function G : 
Conv('P) — i> R with functions G p £ T such that 

S(p,o) = G(p) + G p (o)- [ G p (o)dp(o), (21) 
Jo 

4 This is the same concept as with affine scores: scores cannot 
be 00 and only incorrect reports can yield —00. 



where G p : q t-¥ f a G p (o) dq(o) is a subgradient of G for all 
peV. 

Proof. The given form is truthful by the subgradient in- 
equality. Let S : T — > A be a given truthful affine score. 
Since S(p) £ A, we have some f p G T generating S(p). We 
can therefore use G p : q 1-} JqJ p {o) dq{6) as the subgradi- 
ents in the proof of Theorem \T\ thus giving us the desired 
form. □ 

Corollary [12] immediately generalizes their characteriza- 
tion to the case where V is not convex. While this case is 
not of central importance to the theory of scoring rules, it is 
sometimes a natural case to consider. For example, Babaioff 
et al. 4 examine when proper scoring rules can have the 
additional property that uninformed experts do not wish to 
make a report (have a negative expected utility), while in- 
formed experts do wish to make one. They show that this 
is possible in some settings where the space of reports is not 
convex. Thus our characterization shows that, despite not 
needing to ensure properness on on reports outside V, es- 
sentially the only possible scoring rules are still those that 
are proper on all of A(0). 

While local truthfulness is a natural property to exam- 
ine in mechanism design, it is not obvious how it is as use- 
ful for scoring rules since scoring rule designers tend to op- 
erate under significantly fewer constraints than mechanism 
designers. Nevertheless, to results from Section 13.21 apply, 
so Corollary [S] shows that local properness is equivalent to 
global properness for scoring rules on convex V. 

Corollary 13. For a convex set V C A(O) of probabil- 
ity measures, a scoring rule S : V x O —¥ R is proper if and 
only if it is (weakly) locally proper. 

5.2 Decision Rules 

Theorem [1] also generalizes Gneiting and Raftery's [13] 
characterization to settings beyond eliciting a single distri- 
bution. For example, a line of work has considered a setting 
where a decision maker needs to select from a finite set T> of 
decisions and so desires to elicit the distribution over out- 
comes conditional on selecting each alternative |20l [9] [8]. 
Since only one decision will be made and so only one condi- 
tional distribution can be sampled, simply applying a stan- 
dard proper scoring rule generally does not result in truthful 
behavior. Applying Theorem [T] to this setting characterizes 
what expected scores must be, from which many of the re- 
sults in these papers follow. As it is not our main focus, we 
refrain from introducing the model necessary to explicitly 
state a characterization result similar to Corollary 1121 

6. DISCUSSION 

We have presented a model of truthful elicitation which 
generalizes and extends both mechanisms and scoring rules. 
On the mechanism design side, we have seen how our frame- 
work provides simpler and more constructive proofs of a 
number of known results, some of which (as we saw in Sec- 
tion 15. 1[) lead to new results about scoring rules. We then 
generalized our model to allow reports of a succinct rep- 
resentation of an agent's private information rather than 
that information itself, a topic that has been studied in the 
scoring rules literature about eliciting properties of distribu- 
tions. This led us to new results in both mechanism design 
and scoring rules. 



Our analysis makes use of the fact that S(t')(t) is afhne in 
t to ensure that G(t) = sup t / S(t')(t) is a convex function. 
However, this property continues to hold if S(t')(t) is instead 
a convex function of t. Thus, a natural direction for fu- 
ture work is to investigate characterizations of convex scores. 
While mechanisms can always be represented as afhne func- 
tions by taking the types to be functions from allocations to 
R, it may be more natural to treat the type as a parameter 
of a (convex) utility function. While many such utility func- 
tions are affine (e.g. dot-product valuations), others such 
as Cobb-Douglas functions are not. Berger, Miiller, and 
Naeemi [5] [5] have investigated such functions and given 
characterizations that suggest a more general result is pos- 
sible. Another potential application is scoring rules for alter- 
nate representations of uncertainty, several of which result 
in a decision maker optimizing a convex function [14] . 

In one sense getting such a characterization is straight- 
forward. In the afhne case we want S(t')(t) to be an afhne 
function such that S(t')(t) < G(t) and S(t')(t') = G(t'). 
Since we have fixed its value at a point, the only freedom we 
have is in the linear part of the function, and being a sub- 
gradient is exactly the definition of such a linear function. 
So while our characterization of affine scores is in some sense 
vacuous, it is also powerful in that it allows us to make use of 
the tools of convex analysis. A similarly vacuous character- 
ization is possible for the convex case: S(t')(t) is a convex 
function such that S(t')(t) < G(t) and S(t')(t') = G(t'). 
The challenge is to find a way to state it that is useful and 
naturally handles constraints such as those imposed by the 
form of a utility function. 

Theorem [10] shows that scoring rules for finite properties 
are essentially equivalent the weights and points that induce 
a power diagram. So the natural open problem of charac- 
terizing all scoring rules for a given finite property in some 
effective fashion is equivalent to the problem of character- 
izing all weights and points that induce a particular power 
diagram. We are uncertain about whether this problem has 
been studied, but it is a natural question in that context 
as well. As power diagrams are known to be connected to 
the spines of amoebas in algebraic geometry, tropical hyper- 
surfaces in tropical geometry, and aspects of toric geometry 
used by string theorists [27] there may be interesting corre- 
sponding characterization questions in those fields as well. 

While our examples have focused on mechanism design 
and scoring rules, another interesting direction to pursue is 
other settings where our results may be applicable. One nat- 
ural domain that is closely related to scoring rules for prop- 
erties is the literature on M-estimators in machine learning, 
statistics and economics. Essentially, this literature takes a 
loss function (i.e. a scoring rule) and asks what property it 
elicits. For example, the mean is an M-estimator induced 
by the squared error loss function. Some work in this litera- 
ture (e.g. [19]) requires that the loss function satisfy certain 
properties, and our results may be useful in characterizing 
and supplying such loss functions. 
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